Rank | Count | Beginning |
---|---|---|
477 | 2929 | A |
3639 | 942 | Az |
5863 | 174 | Ez |
5156 | 84 | Egy |
9564 | 71 | Története |
6148 | 66 | Ezt |
8277 | 55 | Mivel |
5343 | 49 | Ekkor |
5672 | 46 | Ennek |
5970 | 46 | Ezek |
5420 | 44 | Élete |
6217 | 44 | Ezután |
7297 | 43 | Később |
5071 | 39 | Ebben |
6603 | 39 | Ha |
8490 | 39 | Nem |
7346 | 34 | Két |
8244 | 33 | Miután |
5582 | 32 | Első |
6894 | 32 | Így |
7031 | 30 | Itt |
5128 | 28 | E |
6344 | 28 | Fekvése |
6055 | 26 | Ezen |
8657 | 25 | Ő |
9507 | 25 | Több |
2360 | 24 | Amikor |
8162 | 24 | Minden |
4686 | 23 | Bár |
6287 | 23 | Ezzel |
In the next four subsections show the most frequent sentence beginnings consisting of N words, N=1, 2, 3, 4. In this subsection we start with N=1.
The most frequent word-N-grams at the beginning of sentences give some insight into sentence composition.
Especially for N=1, we only need a small corpus to identify the most frequent sentence beginnings.
select substring_index(sentence, ' ', 1) as beg, count(*) as cnt from sentences group by substring_index(sentence, ' ', 1) order by cnt desc limit 50;
4.3.1.2 Most Frequent Sentence Beginnings II
4.3.1.3 Most Frequent Sentence Beginnings III
4.3.1.4 Most Frequent Sentence Beginnings IV
4.3.1.1 Most Frequent Sentence Endings I
4.3.1.2 Most Frequent Sentence Endings II
4.3.1.3 Most Frequent Sentence Endings III
4.3.1.4 Most Frequent Sentence Endings IV